Supporting Relational Knowledge Discovery: Lessons in Architecture and Algorithm Design
نویسندگان
چکیده
This paper discusses a few of the lessons we have learned developing a relational knowledge discovery system. The relationships among data instances in relational data provide extra information for “mining.” This additional information has the potential to greatly improve the quality of learned models. However, the dependencies among instances in the data also introduce new statistical challenges for learning algorithms. Relational data provide an ideal environment in which to examine a central challenge of knowledge discovery – its “chicken and egg” character. Data representation can impair the ability to learn important knowledge, but knowing the “right” data representation often requires just that knowledge. With relational data, representation is often a choice; many alternate views of the data provide abundant fodder for reasoning about transformations. In light of this, we discuss representation and design choices that support a co-evolutionary process of knowledge discovery and data transformation in relation data.
منابع مشابه
Drug Discovery Acceleration Using Digital Microfluidic Biochip Architecture and Computer-aided-design Flow
A Digital Microfluidic Biochip (DMFB) offers a promising platform for medical diagnostics, DNA sequencing, Polymerase Chain Reaction (PCR), and drug discovery and development. Conventional Drug discovery procedures require timely and costly manned experiments with a high degree of human errors with no guarantee of success. On the other hand, DMFB can be a great solution for miniaturization, int...
متن کاملCluster Based Cross Layer Intelligent Service Discovery for Mobile Ad-Hoc Networks
The ability to discover services in Mobile Ad hoc Network (MANET) is a major prerequisite. Cluster basedcross layer intelligent service discovery for MANET (CBISD) is cluster based architecture, caching ofsemantic details of services and intelligent forwarding using network layer mechanisms. The cluster basedarchitecture using semantic knowledge provides scalability and accuracy. Also, the mini...
متن کاملPrototype a Knowledge Discovery Infrastructure by Implementing Relational Grid Monitoring Architecture (R-GMA) on European Data Grid (EDG)
This paper describes the implementation of a ScanOnce algorithm in SQL for quick association rule mining and the development of a data mining infrastructure JetGrid. The architecture of JetGrid is designed to be compatible with lower-level grid mechanisms since it is to operate on top of Relational Grid Monitoring Architecture (R-GMA) provided by European Data Grid (EDG). JetGrid for quick know...
متن کاملUsing Clouds for Scalable Knowledge Discovery Applications
Cloud platforms provide scalable processing and data storage and access services that can be exploited for implementing highperformance knowledge discovery systems and applications. This paper discusses the use of Clouds for the development of scalable distributed knowledge discovery applications. Service-oriented knowledge discovery concepts are introduced, and a framework for supporting highp...
متن کاملOPTIMAL DESIGN OF JACKET SUPPORTING STRUCTURES FOR OFFSHORE WIND TURBINES USING ENHANCED COLLIDING BODIES OPTIMIZATION ALGORITHM
Structural optimization of offshore wind turbine structures has become an important issue in the past years due to the noticeable developments in offshore wind industry. However, considering the offshore wind turbines’ size and environment, this task is outstandingly difficult. To overcome this barrier, in this paper, a metaheuristic algorithm called Enhanced Colliding Bodies Optimization...
متن کامل